Student, Text and Curriculum Modeling for Reader-specific Document Retrieval

نویسندگان

  • Jonathan C. Brown
  • Maxine Eskenazi
چکیده

In today's language-learning classrooms, all of the students in a class almost always have the same text to read. Although students have different reading levels, it is impractical for a single teacher to find unique texts matched to each student's abilities. The REAP system was developed to make the process of providing students with individualized texts practical. The texts come in the form of authentic documents retrieved from the Web, and the system tracks and assesses students’ knowledge as they use the system. The system is able to find documents that meet various and individualized criteria. In this paper, we describe our work on modeling lexical familiarity. In particular, we detail the approaches taken for modeling the student's vocabulary knowledge, the contents of documents in the corpus, and the components of the curriculum. We also address related work and future plans.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Document Embedding Method for News Classification

Abstract- Text classification is one of the main tasks of natural language processing (NLP). In this task, documents are classified into pre-defined categories. There is lots of news spreading on the web. A text classifier can categorize news automatically and this facilitates and accelerates access to the news. The first step in text classification is to represent documents in a suitable way t...

متن کامل

Document Image Dewarping Based on Text Line Detection and Surface Modeling (RESEARCH NOTE)

Document images produced by scanner or digital camera, usually suffer from geometric and photometric distortions. Both of them deteriorate the performance of OCR systems. In this paper, we present a novel method to compensate for undesirable geometric distortions aiming to improve OCR results. Our methodology is based on finding text lines by dynamic local connectivity map and then applying a l...

متن کامل

A Joint Semantic Vector Representation Model for Text Clustering and Classification

Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...

متن کامل

Improving Keyphrase Extraction from Biomedical Documents Using Domain Specific Feature Set

Keyphrases enable the reader to quickly determine whether the given article is suitable for the reader’s digest. Keyphrases are also important for medical document retrieval and text mining research. Sometimes, the author-assigned Keyphrases or keywords available with the articles are too limited to represent the topical content of the articles. Many medical documents also do not come with auth...

متن کامل

Retrieval of Authentic Documents for Reader-Specific Lexical Practice

When a teacher gives a reading assignment in today’s language learning classrooms, all of the students are almost always reading the same text. Although students have different reading levels, it is impractical for a single teacher to seek out unique texts matched to each student’s abilities. In this paper, we describe REAP, a system designed to assign each student individualized readings by co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005